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Abstract — Distributed storage systems need to store data re- 
dundantly in order to provide some fault-tolerance and guarantee 
system reliability. Different coding techniques have been proposed 
to provide the required redundancy more efficiently than tra- 
ditional replication schemes. However, compared to replication, 
coding techniques are less efficient for repairing lost redundancy, 
as they require retrieval of larger amounts of data from larger 
subsets of storage nodes. To mitigate these problems, several 
recent works have presented locally repairable codes designed to 
minimize the repair traffic and the number of nodes involved per 
repair. Unfortunately, existing methods often lead to codes where 
there is only one subset of nodes able to repair a piece of lost data, 
limiting the local repairability to the availability of the nodes in 
this subset. 

In this paper, we present a new family of locally repairable codes 
that allows different trade-offs between the number of contacted 
nodes per repair, and the number of different subsets of nodes that 
; enable this repair. We show that slightly increasing the number 
of contacted nodes per repair allows to have repair alternatives, 
which in turn increases the probability of being able to perform 
efficient repairs. 

Finally, we present pg-BLRC, an explicit construction of locally 
'repairable codes with multiple repair alternatives, constructed 
from partial geometries, in particular from Generalized Quad- 
rangles. We show how these codes can achieve practical lengths 
and high rates, while requiring a small number of nodes per repair, 
and providing multiple repair alternatives. 

I. Introduction 

In recent years, distributed storage systems as used in large 
data centers have started to incorporate coding techniques 
to redundantly store data across different storage nodes. For 
example, Facebook reported that it is archiving old data using a 
classic Reed-Solomon code implemented on top of the Hadoop 
Distributed File System (HDFS) [j], (2), and Microsoft uses a 
Pyramid Code as the main storage primitive of its Azure storage 
service |3|. The use of coding mechanisms in these distributed 
storage systems provides significantly higher fault-tolerance 
values and lower storage overheads than simpler replication 
schemes HI. For example, in systems like HDFS or Azure, 
coding techniques allow to store data with a footprint of 1.3- 
1.5 times the size of the original object, which represents a 
footprint reduction of 50% as compared to the de-facto standard 
3-replica scheme. 

The main problem of using traditional coding techniques 
in distributed storage systems is that repairing lost encoded 
data requires the retrieval of large amounts of data from large 
subsets of nodes, which entails an important network traffic 
and lots of input/output (I/O) operations. In today's large 
distributed storage systems where node failures are the norm 
rather than the exception, minimizing the communication costs 



due to data repairs has therefore become an important problem. 
Regenerating Codes [5] were the first families of a new wave of 
codes especially designed to minimize repair costs in distributed 
storage systems. Regenerating Codes address the repair problem 
by describing an optimal trade-off between the storage overhead 
of the code and its repair communication costs. However, repair 
processes in Regenerating Codes require contacting a large 
subset of nodes, which complicates the design of the storage 
system and increases the number of required I/O operations. 

A different line of new codes, called locally repairable codes 
(LRC), also addresses the repair problem, but focusing on 
reducing the number of nodes contacted during repair |[6|-|[TT[, 
while still guaranteeing a low repair traffic. However, the main 
problem with existing locally repairable codes is that although 
they reduce the size of the subset of contacted nodes, they suffer 
from the drawback that only a single subset of nodes enables 
the repair of a specific piece of redundant data. If a single 
node from this repair subset is not available, data cannot be 
repaired "locally", increasing the cost of the repair. To solve this 
problem, some codes like Pyramid codes fl2) , or Hierarchical 
codes [13 1, provide different subsets of nodes that enable the 
repair of each piece of redundant data. However, these different 
repair subsets do not have all the same size: during normal 
operations, lost data can be repaired using the smallest subset, 
however, when the number of unavailable nodes grows, repairs 
require the use of larger subsets, thus increasing the repair cost. 

In order to maximize the reliability of storage systems it 
is therefore desirable to obtain codes where lost data can be 
repaired by contacting a small number of nodes r, where this 
number can be as small as r = 2. In addition, since the repair 
process might not be able to contact some of these r nodes 
during repair (e.g., due to temporary node unavailabilities, or 
even a correlated failure of multiple nodes), it is also desirable 
to have a > 1 different alternative r-subsets of nodes enabling 
the repair of any lost data. Unfortunately, although some locally 
repairable codes |8|, [9| present constructions where a > 1, to 
the best of our knowledge there is no publication focusing on 
the analysis and design of codes that allow a trade-off between 
the values of a and r. 

Contributions: 

In this paper, we present a new framework to facilitate the 
analysis and design of locally repairable codes with different 
trade-offs between their repair locality and their number of 
repair alternatives per failure. This framework allows us to 
define the "local repair tolerance" as a new metric that measures 
the maximum number of nodes that can be unavailable in the 



system without compromising the ability to locally repair all 
the stored data. Furthermore, we present pg-BLRC codes, an 
explicit construction of locally repairable codes with multiple 
repair alternatives, constructed from partial geometries. We pro- 
vide an upper and lower bound for the rate of such codes, which 
is attained by the class of Generalized Quadrangles. Design of 
efficient pg-BLRC codes involves a trade-off among three of 
its desirable features: (i) small repair locality, (ii) large number 
repair alternatives, and (iii) low storage footprint, identified 
through numerical evaluations. Focusing on optimizing any of 
these properties leads to a bad performance of the other two. 
Requiring multiple repair alternatives thus introduces a new 
point of view in the study of the optimal trade-offs between 
the rate and the repair locality of storage codes. 

II. Linear Codes for Distributed Storage 

Let q be a prime power. A q-ary linear code C of length n and 
rank k is a fc-dimensional linear subspace of the vector space 
F™, where ¥ q is the finite field with q elements and n > k. 
As a linear subspace, the code C can be defined by a k x n 
full-rank generator matrix G as 

C={oG:oeFj}, 

where the vectors c <E C are called the codewords of C. The 
code C can alternatively be defined using an (n — k) x n full- 
rank parity check matrix H such that GH T = 0, as 

C = {c £ F ? ; : Hc T = 0} . 

In distributed storage systems, a data object of k symbols is 
represented as a vector o £ Fj, containing fc[log 2 g] bits. To 
redundantly store this object across different storage nodes, the 
system first obtains the codeword c = oG £ F™, and then it 
stores the n symbols of c on n different storage nodes. Since 
n > k, each stored object requires a disk capacity larger than 
its original size. The rate R = k/n of the code represents 
the proportion of storage capacity that is used to store non- 
redundant symbols. The closer R is to one the more storage- 
efficient the code is. The efficiency of codes is also measured 
in terms of the storage footprint, n/k, which is the capacity 
used to store each data object compared to its original size. 

After storing the codeword c the system can reconstruct the 
original object o by gathering some k symbols out of the n 
stored ones. For that, let X, \X\ — k, be a set containing 
the indexes of these k symbols. Then the object o can be 
reconstructed by solving the system o = cxGj 1 , where cx 
is the vector composed of the elements of c that are indexed 
by the members of X, and similarly, Gx is the submatrix of 
G composed of the columns of G that are also indexed by the 
members of X. It is important to note that the previous system 
can only be solved if the matrix Gx is invertible. When this 
matrix is invertible for any fc-subset X we say that the code is 
a maximum distance separable code (or MDS code). 

III. Data Repairability in Linear Codes 

As pointed out in the introduction, providing efficient mech- 
anisms to repair lost encoded data is an important problem 
in distributed storage systems. In this section we introduce an 
analytic framework to evaluate the repairability properties of 
different linear codes, focusing on codes providing local repairs. 



Let the code C be the dual code of C, the code generated 
by the parity check matrix H. Then, by definition all codewords 
v £ C are parity check vectors of C, which means that vc T = 

for any c £ C. This parity check property can be used to 
construct repair mechanisms able to repair lost symbols in the 
codewords of C. For that, let v £ C be a parity check vector 
with a nonzero ith symbol, that is v(z) ^ 0. Then, repairing the 
zth symbol of c, c(i), consists of solving the equation vc T = 0, 
or equivalently, solving 

v(l)c(l) + v(2)c(2) + ■ ■ • + v(n)c(n) = 0. 

This equation has then as many unknowns as number of 
nonzero symbols in v, or w(v) unknowns, where w is the 
Hamming weight function. Then, repairing a missing symbol 
of c requires to retrieve w(v) — 1 other symbols and solve the 
equation for c(i). However, retrieving w(v) — 1 symbols over a 
communication network might entail a significant overhead in 
terms of network traffic. Consequently, to minimize this traffic 
it is important to design codes C guaranteeing that for every 

1 £ [n] = {1, 2, . . . , n} there exists at least one vector v £ C 1 - 
with v(i) 7^ 0, and small Hamming weight w(v). 

To analyze the repair efficiency of codes we can enumerate 
all the possible ways in which the zth symbol of a codeword 
c G C can be repaired. To that end, we define f2(z) as the set 
containing all the parity check vectors repairing this symbol: 

n(i) = {v £ C 1 - : v(i) ^ 0} . 

Then, for each i £ [n], we can evaluate the cost of repairing 
the zth symbol by analyzing the Hamming weight of all vectors 
v £ The repair degree is a metric that describes the 

number of symbols that need to be retrieved per repair: 

Definition 1 (Repair Degree). We define the repair degree for 
the ith codeword symbol as r(i) — min {w(v) — 1 : v <= f2(i)}, 
and the overall repair degree r of a linear code is its maximum 
repair degree: r — max {r(i)}™ =1 . 

In classic MDS codes such as Reed-Solomon codes fl4) , or 
in the newest Regenerating Codes [5], we have that the repair 
degree is at least equal to the rank of the code, r > k. For 
Reed-Solomon codes it means that repairing a single failure 
requires to transfer an amount of information equal to the size of 
the original object, or /c|~log 2 q] bits. For Regenerating Codes, 
although they still have to contact at least k nodes per repair, 
the overall amount of data transferred per repair can be slightly 
reduced below the size of the original object. However, instead 
of aiming at reducing the repair traffic, in this paper we are 
interested in codes able to improve the repair performance by 
reducing the number of nodes that need to be contacted per 
repair below k, which in turn leads to reduce the network traffic 
per repair as well. We will refer to those codes with an overall 
repair degree much smaller than its rank (r <C k) as locally 
repairable codes, or LRC. 

IV. Codes with Multiple Repair Alternatives 

Even though locally repairable codes significantly reduce the 
number of nodes that need to be contacted during repairs, it 
is often also desirable to guarantee the existence of multiple 
subsets of this kind. This is especially important in storage 
systems where some of the nodes to be contacted might be 
unavailable, either because they are temporary busy, or because 



there was a correlated failure affecting several nodes. In this 
section we measure the multiple repair alternatives of LRC 
codes and their tolerance to locally repair missing data in the 
presence of node unavailabilities. 

Let Q, r (i) be the subset of containing the vectors that 
allow to repair c(i), c G C, with a repair degree at most r, 
i.e., O r (i) = {v G f2(i) : w(v) < r + 1}. Each of these vectors 
represents then a possible alternative to repair the ith symbol 
of any codeword c G C. 

Definition 2 (Repair Alternativity). The repair alternativity of 
the ith codeword symbol is the number of distinct subsets of 
nodes with at most r nodes, which contain enough information 
to repair tlie ith symbol. The repair alternativity of i is then 
a(i) — \fl r (i)\, and the code's overall repair alternativity is 
a = min{a(i)}" =1 . 

Non-locally repairable MDS codes such as Reed-Solomon 
codes and Regenerating Codes have a large repair alternativity 
of a — ("), which guarantees that all stored symbols can 
be repaired even when a large portion of storage nodes is 
unavailable. In fact, all symbols can be repaired as long as 
the stored information is available (that is, if k symbols are 
available). On the other hand, most of the existing locally 
repairable codes |6), (7) have a local repair alternativity of 
a = 1. In this case, if any of the r nodes involved in the repair 
is temporary unavailable, the code cannot use the local repair 
mechanisms, requiring then more expensive repair solutions. 
To the best of our knowledge SRC JS), (9) are the only LRC 
codes with a repair alternativity larger than one, a > 1. 
Unfortunately, in SRC codes the value of a depends on the 
code construction, which does not allow to obtain codes with 
arbitrary a values. Moreover, obtaining SRC codes with large 
a leads to unpractical codes with low rate R. 

In this paper, we focus on the design of locally repairable 
codes with arbitrary repair alternativity and practical rates. 
Having codes with multiple repair alternatives increases the 
probability to be able to locally repair lost data when some 
nodes are unavailable. To maximize the local repair probability, 
it is therefore important to guarantee that the number of com- 
mon nodes in different repair alternatives for a given symbol is 
as small as possible. For example, in order to maximize the 
number of repair alternativities of the ith symbol, a(i), we 
have to minimize | supp(v) n supp(u)|, for any distinct pair 
of vectors v.u G 0, r (i), where supp(v) is the support of v, 
which is the set containing the indexes of the non-zero positions 
of v. Using this concept, we can formally define the local repair 
tolerance of a LRC code as follows: 

Definition 3. The local repair tolerance of the ith symbol, 8(i), 
is the size of the smallest set of coordinates different than i that 
intersects with the support of all codewords in Q r (i): 

S(i)=mhx{\l\ :lc[n]\{i}, Jn supp(v) ^ 0, VveO r (i)}. 

The overall local repair tolerance of the code is 
8 = min{5(i)}" =1 . 

This means that the ith symbol can be locally repaired 
when at most 5(i) nodes are unavailable, and any symbol is 
locally repairable when at most 8 nodes are unavailable. In 
the design of locally repairable codes we will aim then to 
maximize the value of 8. However, a code design process that 



only focuses on the maximization of 8 might end up with 
unbalanced codes where S(i) <C 8(j) for different symbols 
i, j G [n], increasing the probability of being able to locally 
repair some symbols with respect to others. To avoid this issue 
one can try to smooth out the difference between the repair 
tolerance of different symbols. In general we are interested in 
balanced locally repairable codes: 

Definition 4 (Balanced Codes). When 8(i) — 8(j) for all 
i,j G [n], i 7^ j, we say that the code is balanced. 

We will refer to these balanced locally repairable codes as 
BLRC codes, and we will use the notation (n, k, r, a, 5)-code 
to refer to a code of length n and rank k, where each symbol 
of the codeword can be repaired from at most r other symbols, 
having at least a different sets of symbols that guarantee such 
a repair, and being able to locally repair each symbol if there 
are at most 8 unavailable symbols. In the next section we will 
present a simple code construction of BLRC code with arbitrary 
a and r values. 

V. Balanced Locally Repairable Codes 
from Partial Geometries 

We present a way to generate explicit BLRC constructions 
from partial geometries and analytically evaluate the new codes 
in terms of repair tolerance 8 and code rate R. We first provide 
a brief description of partial geometries. 

A. Partial Geometries 

A partial geometry pg(s,t,a) is an incidence structure be- 
tween a set of points V and a set of lines B such that: 

1) Each point P G V is incident with t + 1 lines (t > 1). 

2) Each line B G B is incident with s + 1 points (s > 1). 

3) Any two lines have at most one point in common. 

4) If a point P and a line B are not incident, there are 
exactly a (a > 1) pairs (Q, M) £VxB, such that P is 
incident with M and Q is incident with B. 

It follows that 1 < a < min {t + 1, s + 1}, and by definition 
of partial geometries the cardinalities of the point and line sets 
must satisfy: 

= (s + l)(st + a) = (t + l)(st + a) 

a a 

The dual of a partial geometry, which is the incidence 
structure that arises from interchanging the set of points V with 
the set of lines B, is also a partial geometry with parameters 
pg(s' — t,t' — s,a). Finally, according to the values of the 
parameters s, t and a, partial geometries can be divided into 
four classes: 

1) When a = s+1, or dually a = t + 1, the partial geometry 
is a Steiner 2-design. 

2) When a = s, or dually a = t, the partial geometry is 
called a net or a transversal design. 

3) When a = 1, the partial geometry is called a generalized 
quadrangle. 

4) For 1 < a < min{s, t} the partial geometry is proper. 

As we will show in Section |V-C| generalized quadrangles are 
of special interest to design optimal codes in terms of rate. 



B. Codes from Partial Geometries 

Partial geometries and other incidence structures have been 
widely studied for the construction of LPDC codes ]T5) , p6[ . 
The incidence matrix of partial geometries can be used as 
a simple mechanism to obtain sparse parity-check matrices 
with low rank over Fa, which makes them particularly suitable 
to construct high rate LPDC codes with efficient iterative 
decoders. The similarity of the requirements of these LPDC 
codes with those of BLRC codes makes incidence matrices of 
partial geometries a promising source of BLRC designs. We 
will use some of the results of Johnson and Weller fl6) to 
evaluate the local repairability of such codes. Although in the 
previous sections we considered generic q-ary codes, in this 
section we limit our code designs to binary codes, i.e., q = 2. 
As we will show, this limitation does not effect the rate of the 
obtained code. 

Let us define the incidence matrix of a partial geometry 
pg(s,t,a) as a \B\ x \V\ matrix N = (ny), where mj = 1 
or according as whether the ith line is incident with the 
jth point or not. Constructing a linear code C from a partial 
geometry consists then of building an m x n parity check 
matrix H containing m linear independent rows of N, where 
m = rank 2 (N). Then we can obtain the generator matrix 
G of the code C by obtaining the row-echelon form of H, 
H = Q], for example using the Gauss-elimination 

method. Since GH T = 0, the generator matrix is then defined 
as G = [— Q T \h}- Note that the code C defined by the 
generator matrix G contains an identity matrix and is then a 
systematic code. We will denote a code C constructed from a 
partial geometry pg(s,t,a), as a pg-BLRC code. 

Besides the formal definition of pg-BLRC codes, we can also 
state the following lemma regarding the repair degree and the 
repair alternativity of such codes. 

Lemma 1. The repair degree of a pg-BLRC code C and its 
repair alternativity satisfies r < s, and a > t + 1. 

Proof: From the properties of partial geometries we have 
that every point P E V is incident with t + 1 lines, and each 
of these lines is at the same time incident with s + 1 points. 
It means that for each i E [n] there are t + 1 rows of the 
incidence matrix N, namely v ...v t , such that Vj(i) = 1 
and w(vj) = s + 1 for all j E [n]. Then, by definition of 
f2 r (z) we have that Vj E f2 r (z), for all j = 0, . . . , t, and hence 
|O r (i)| > t + 1. Then, from Definition [2] we get that a > t + 1, 
and similarly, from Definition [T] we get that r(i) < s for all 
i E [n] and thus r < s. ■ 
For the rest of the paper we will call (r, a) pg-BLRC codes 
those codes constructed from partial geometries pg(s,t,a), 
where s > 1, When s = 1 any lost data can be repaired by 
contacting a single node in the system, which corresponds to 
a simple data replication scheme. The condition s > 1 allows 
us to exclude replication schemes from the definition of our 
BLRC codes. 

Lemma 2. A pg-BLRC code with s > 1 has a per-symbol 
repair tolerance bounded by 5(i) > t + 1, for all i E [n], and 
an overall repair tolerance bounded by S > t + 1. 

Proof: Let Af(i) C fl r (i) be the set containing all the rows 
v of the incidence matrix N satisfying that v(i) = 1, for all 
i E [n]. One the one hand, let us first assume thatTV(i) = fi r (i). 



In this case, from property 3) of partial geometries we have that 
supp(v) l~l {j} = for all v E Af(i), j E [n] \ {i}, and then 
from Definition [3] it follows that S(i) = a(i) > t + 1 and 
6 = a > t + 1. On the other hand, if we assume Af(i) C fi r (i), 
then the extra vectors w E Q, r (i) \AT(i) might only contribute 
to increase the repair tolerance of the symbol i. ■ 
From the last two lemmas we know that a (r, a) pg-BLRC 
code C constructed from a pg(s,t,a) partial geometry guar- 
antees that r < s, a > t + 1 and S > t + 1. Not achieving 
these three bounds with equality implies that there exists some 
codewords in the dual code C with a Hamming weight smaller 
than s + 1. Due to the difficulty of finding such type of 
codewords, we will refer to a (r, a) pg-BLRC code as code with 
a designed repair degree r, and a designed repair alternativity 
a. This gives to the designer of the storage system the guarantee 
to be able to repair all missing symbols by contacting r other 
nodes, having a alternative r-subsets that enable this repair (the 
lines of the geometry), and the guarantee that the system can 
repair any failure when at most 6 = a nodes are temporary 
unavailable. 

C. Rate Bounds of pg-BLRC Codes 

In the previous sections we have presented the construction 
of pg-BLRC codes and the properties that allow to measure the 
repair performance of these codes. However, besides offering 
efficient repair mechanisms, codes used in distributed storage 
systems also need to guarantee low storage overheads (or 
equivalently high code rates). In this section we provide an 
upper and lower bound for the rate R of (r, a) pg-BLRC codes. 

Theorem 1. The rate R of an (r, a) pg-BLRC code C is lower 
bounded by 

r 2 

R ^ "7 V, r, 

" (a + r-l)(r + l) 
and when r + a — 1 is even, the rate is upper bounded by 

< a(r 2 - r + 1) - (r - l) 2 
" (a + r-l)(r(a-l) + l)' 

Proof: The rank of an (r,a) pg-BLRC code is 

k n - rank 2 (AT) 

H= — = , (1) 

n n 

where N is the incidence matrix of the partial geometry. From 
Johnson et al. fl6) we have that the rank 2 (N) of such codes is 
upper bounded by rank 2 (H) < # + 1, and when s + 1 + 1 — a 
is even, then is is lower bounded by $ < rank 2 (H), where 

^ q= st(s + l)(t+l) 
a(t + s + 1 — a) 

Lower Bound: The minimum possible rank is achieved when 
rank 2 (N) = ■& + 1. Substituting in ^ rank 2 (N) by d + 1, 
and n by \V\ we get: 

R > -sa + s(s~ 1) = Q(a) 

- -(s+l)a + s(s + 2) + t(s + l) + l 5(a)' 

Since both numerator and denominator have negative slopes 
and Q(a) < 5(a) for all 1 < a < min{s + l,t + 1}, s > 1 
and t > 1, then the maximum possible value is achieved when 
a = 1. The lower bound is obtained by evaluating Q(l)/5(1) 
and substituting s := r and t := a — 1. 




Fig. 1: Maximum theoretical rate R achieved by pg-BLRC 
codes with repair degre r and repair alternativity a. 

Upper Bound: Similarly, the maximum possible rank is 
achieved when rank2 (N) — ■&. Substituting in ^ rank2 (N) 
by -d, and n by \V\ we get: 



R < 



= 1 



a(st-s-t-l)- ts 2 
■ a (st — s — t — 1) — ts 2 



ts 



t 2 + ts 



a {st - s - t - 1) - ts 2 - t s 



ts 



Note that we can write the divisor as a polynomial function 

Q(o) = a 2 
value at a n 



-a(st— s—t—l)—ts —t —ts, which has minimum 
, = — ~(sf — s — t — 1), which satisfies a m ; n < 1. 
Then, since the valid values of alpha are a e [min{s + l,t + 
1}] and the function Q(a) is monotonically increasing in this 
interval, the maximum rate R is obtained when a = 1. The 
upper bound is obtained by evaluating R < 1 + (t 2 + ts) /Q(l) 
and substituting s := r and t := a — 1. ■ 

Remark 1. The maximum rate of an (r,a) pg-BLRC code 
is achieved when the source partial geometry pg(s,t,a) is a 
generalized quadrangle (a = 1). 

In Figure [T] we use the bounds from Theorem [T] to depict 
the minimum and maximum^] possible theoretical rates of an 
(r, a) pg-BLRC code with a = 1 for different combinations 
of r and a values. In general it is interesting to see how the 
rate decreases when we either (i) decrease the repair degree r, 
or (ii) increase the repair alternativity a. It means that the two 
main objectives to achieve efficient repair mechanisms (small 
r values and large a values) entail an increase in the storage 
overhead of the system (low rate R), posing an optimization 
trade-off for designers of storage designers. 

D. Explicit pg-BLRC Code Constructions 

Note that there is no known construction of generalized 
quadrangles for all possible s and t values, and until now, 
only a few generalized quadrangles are known [17]. They are 
those with (s,t) e {(2, 2), (2, 4), (3, 3), (3, 9), (3, 5), (4, 4), 
(4, 6), (4, 8), (4,16)}, and those with (s,t) G {(q-l,q+l), 
(<7>9): (<2S9 2 )j (<7 2 )9 3 )} f° r an Y prime power q, and their dual 
constructions. If we evaluate all the possible (s, t) pairs and 
filter out the cases where R < 1/3 or n > 100, which are the 
rates and length interesting from a practical point of view, then 

'Limited to the cases where r + a — 1 = 1 mod 2. 



we have only six known generalized quadrangles left, namely, 

(a, t) e {(2, 2), (3, 1), (3, 3), (4, 2), (4, 4), (5, 3)}. Or in terms 
of repair degree and repair alternativity: (r, a) 6 {(2, 3), (3, 2), 
(3,4), (4,3), (4,5), (5,4)}. 

VI. Conclusions 

In this paper, we presented a new approach to design lo- 
cally repairable codes. Instead of focusing on codes achieving 
minimum repair locality and maximum rate, we analyze how 
to increase the diversity of this repair locality, thus providing 
more than one local repair alternative for data block that needs 
repair. We present an explicit construction of locally repairable 
codes that provide different trade-offs between repair locality 
and number of repair alternatives per failure. We also provide 
an upper and lower bound on the attainable rate of such codes. 
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